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Abstract 

We consider the following combinatorial question. Let 

S 0 C Sx C S 2 C ... C S m 

be nested sets, where #(5j) = i. A move consists of altering one of the sets S t , 
1 < i < m — l,ina manner so that the nested condition still holds and #(Si) is still i. 
Our goal is to find a sequence of moves that exhausts through all subsets of S m (other 
than the initial sets Si) with no repeats. We call this “Gray-coding through nested 
sets” because of the analogy with Frank Gray’s theory of exhausting through integers 
while altering only one bit at a time. Our main result is an efficient algorithm that 
solves this problem. As a byproduct, we produce new families of cyclic Gray codes 
through binary m-bit integers. 


1 Introduction 

The nested set problem can be stated as follows. Let 

So C Si C S 2 C . . . C S m 

be nested sets, where #(Sj) —%. A move consists of altering one of the sets S % , for 1 < i < m, 
while maintaining the conditions that Jfi(Si) = i and that the sets are nested. Our goal is to 
find a sequence of moves that exhausts through all subsets of S m (other than the initial sets 
Si) with no repeats. We call this “Gray-coding through nested sets” because of the analogy 
with Frank Gray’s theory on stepping through all m-bit integers by altering just one bit at a 
time. (For the theory of Gray codes see ^ or ^ Section 2.2.2].) The problem of Gray-coding 
through nested sets does not seem to have been considered before in the literature, but it fits 
naturally into the genre of “combinatorial Gray codes” that has been popularized by ffilbert 
S. Wilf, Carla Savage, F. Ruskey, and others. We quote from Savage’s survey article [6]: 
“The term combinatorial Gray code ... is now used to refer to any method for generating 
combinatorial objects so that successive objects differ in some prespecified, usually small, 
way.” Examples of such combinatorial objects include ^-element subsets of an n-element set 
HM, permutations, binary trees, or partitions. 


Given nested sets S t as above, define the difference sequence to be (g 3 , g 2 ,. .., q m ), where 
q, is the unique element of S t \ S t -\. The difference sequence gives enough information to 
determine all the sets S ). The only valid way to alter S t is to remove q t and replace it with 
qi- |_i, and this has the effect of transposing g, : and g;+i in the difference sequence. Then Si is 
replaced by the set {q 1: q 2 , ■ ■ ■, qi- 1 , g,:+i}- 

Since a move is completely determined by i, we can record a sequence of moves by listing 
the indices. That is, the sequence [i\, i 2 , f 3 , ■ ■ . ,iv] means to first alter .S’,,, then S i2 , etc. 
We call this a stepping sequence if every subset of S m (other than the initial sets) appears 
exactly once. In other words, a stepping sequence is a sequence of moves that solves the 
problem of Gray-coding through nested sets. Since there are 2 m subsets and to + 1 initial 
sets Si, a stepping sequence for m has length 2 m — to — 1. If Si is altered twice in a row, the 
set returns to its original value. This is not allowed since we are not supposed to generate a 
set that has been seen before. Thus, any two consecutive terms in a stepping sequence must 
be distinct. Also notice that i occurs in the stepping sequence exactly ( m ) — 1 times. 

We first consider small examples. For m = 1, we have So = 0 C Si = {gi}. There are no 
additional subsets of {gi} other than the initial sets So and S\, so the stepping sequence is 
empty. For m = 2, we begin with 

S'o = 0 C Si = {gi} C S 2 = {gi, g 2 }. 

The only subset of S 2 that has not been seen is {g 2 }. We need a single move, which is to 
alter Si, and the associated stepping sequence is [1]. 

For m = 3 we begin with 

S 0 = 0 C Si = {gi} C S 2 = {gi, g 2 } C S 3 = {gi,g 2 , g 3 }. 

One solution is as follows: change S 2 to {gi,g 3 }, change Si to {g 3 }, change S 2 to {g 2 ,g 3 }, 
then change Si to {g 2 }. This is codified by [2,1,2,1], The only other solution is to first 
change Si to {g 2 }, then change S 2 to {g 2 ,g 3 }, then change Si to {g 3 }, then change S 2 to 
{gi, g 3 }. This solution is codified by [1, 2,1, 2], 

Since the sets Si are nested, the complements S m \ Si are also nested, but with the 
inclusions going in the reverse direction. When we alter S), we also alter its complement. 
Further, if the Si s run through every possible subset, then so do the complements. The 
complement of S) has order m — i. Thus, if [i\, ..., i/v] is a stepping sequence for m, then so 
is [to — ii, ..., to — ijv]- We call this the complementary stepping sequence, or more simply, 
the complement. In particular, for the order-3 example above, the complement of [2,1, 2,1] 
is [1,2,1,2], 

In addition, the reverse of a stepping sequence is a stepping sequence. This can be 
seen by running a stepping sequence backwards (beginning with the final set values S') and 
noting that one finishes with the sets S t , and in the intermediate steps one passes through 
all remaining subsets of S m . The stepping sequence [2,1,2,1] has the property that its 
complement is equal to its reverse, but not all stepping sequences have that property. For 
to = 4, there are exactly 34 stepping sequences, and ten of these have the property that the 
complement is equal to the reverse. 



In this article, we present four ways to create stepping sequences: by recursion, by 
greed, with a for-loop, and with a different for-loop. It turns out that all four methods are 
equivalent, he., they give rise to the same stepping sequence. In the final section, we show 
how the nested set problem is related to other combinatorial Gray codes, and we pose some 
open problems. 


2 The recursive method 

The following theorem enables us to build stepping sequences recursively. 

Theorem 2.1. If [ft...., Ik] and [i \...., i' K \ are stepping sequences for m — 1 (so K = 
2 m ~ 1 — m), then 

[A + 1, i 2 + 1,.. ■, ik + 1] U [1, 2,..., m - 1] U [ft, i ’ 2 ,..., 4] (1) 

and 

[ft, * 2 ,..., ik] U [m - 1,..., 1] U [i[ + l,i’ 2 +1,... ,i’ K + 1] (2) 

are stepping sequences for m. 

Proof. We first prove that ([I]) is a stepping sequence for m. Suppose initially Si = {a}. Let 
S' = S* \ {a} for i — 1,..., m — 1. If we applied [ft, • • •, iff to S', we would obtain every 
subset of S' m , except for the initial sets S'. By doing the same sequence of moves, but on the 
sets Si = S' U {a} instead of the sets S', we run through every subset of S m that contains 
a, except for the initial sets Si,...,S m . Since |Sj| = |S'| + 1, this sequence of moves is 
codified as [ft + 1,..., ix + 1]- At this point we have seen exactly once every subset of S m 
that contains a, and no other nonempty subset, and the difference sequence has the form 
(a, ft, ft,..., q m - i)- Note that {ft, ft,..., q m - 1 } = S m \ {a}. 

Now we have nested sets 

S 0 = 0, Si = {a}, S 2 = {a,ft}, S 3 = {a, ft, ft}, ..., S m = {a, ft,..., q m -i}- 

The moves [1, 2,..., m — 1] change Si to {ft}, then change S 2 to {ft, ft}, then change S 3 
to {ft, ft, ft}, and so on. All these subsets are new, because they do not contain a. After 
these moves, we have Si = {ft, ft,..., ft} for i < m. In particular, S m _i = {ft,..., q m -i} = 

S m \ {a}- 

Now applying the moves [z' 1; z' 2 ,..., i' K \ to the nested sets S 0 , Si,..., S m _i produces all 
remaining subsets of S m _i, i.e., all remaining subsets of S m that do not contain a. 

In summary, after applying the moves [zi + 1,..., %k + 1] we have seen all sets that contain 
a exactly once. The moves [1, 2,..., m — 1] produce new sets Si = {ft}, S 2 = {ft, g 2 }, ..., 
S m _i = {ft,..., q m -i} that do not contain a. Finally, the moves [ft, z 2 ,..., ix] produce all 
remaining subsets that do not contain a, and each set is produced exactly once. Thus all 
subsets of S m are produced exactly once, proving that ([I]) is indeed a stepping sequence 
for m. 


By applying ([I]) to the complements [m 1 — i \,..., m — 1 — ix\ and — — 

1 — i ' K \, we see that 

[m — *i, m — i 2 ,.. ■, m — ix\ U [1, 2,..., m — 1] U [m — 1 — 2^, m — 1 — i ' 2 ,..., m — 1 — i' K ] 

is a stepping sequence for m. Taking the complement, we find that ([2]) is also a stepping 
sequence for m. □ 

The stepping sequences for m that are produced in the theorem have length 2K+m— 1 = 
2(2 m ~ 1 — (m — 1) — 1) + m — 1 — 2 m — m — 1, as expected. 

The theorem enables one to build stepping sequences for m out of smaller stepping 
sequences. In particular, we define: 

R 2 = [1], and R m = (R m -i + 1) U [1, 2,..., m - 1] U R m -i for m = 3,4,..., (3) 

where R m -1 + 1 is obtained by adding one to each element of R m _ 1 . Then R m is a stepping 
sequence for m by Theorem 2.1. The first few are given by: 

R2 = [1] 

R 3 = [2] U [1, 2] U [1] = [2,1, 2,1] 

Ra = [3, 2, 3, 2] U [1, 2, 3] U [2,1,2,1] = [3,2, 3, 2,1,2, 3, 2,1,2,1] 

R 5 = [4,3,4,3,2,3,4,3,2,3,2,1,2,3,4,3,2,3,2,1,2,3,2,1,2,1], 

Figure 1 below illustrates nested-set generation for the stepping sequence i? 4 when S t = 
{1,2 ,... ,i}. The stepping sequence is listed in the first column, the difference sequence in 
the second column, and the newly generated sets in the remaining columns. Notice that the 
sets containing 1 are generated first, then the sets not containing 1. 


Lemma 2.2. The reverse of R m is equal to its complement. 

Proof. We must show that the sum of the i-th element of R m and the z-tli element of its 
reverse is always equal to m. This is true for R 2 . Assume inductively that it is true for 
R m - 1 - Let K’ = #(R m - 1 ). For i = 1,..., K', the sum of the i-th element of R m and the 
f-th element of its reverse is one more than the sum of the z-th element of R m - 1 and the 
z-th element of its reverse. By induction, this sum is 1 + (?n — 1) = m. The middle part 
of R m is [1..., m — 1], and this also has the property that the sum of the i-th element of 
[1,..., m — 1] and the 2 -th element of its reverse is m. This completes the induction and the 
proof. □ 

One might ask the question: Does the recursive method generate all stepping sequences? 
The answer is “no”. For m = 4, we exhausted through all sequences in {1, 2, 3} 11 and found 
that exactly 34 of these were stepping sequences. Theorem 2.1 produces 


[3, 2, 3, 2,1, 2, 3, 2,1, 2,1], [2, 3, 2, 3,1, 2, 3,1, 2,1, 2] 


Stepping 

Sequence 

Difference 

Sequence 

-Si 

s 2 

S 3 

s 4 

Initial 

(1,2, 3,4) 

{1} 

{1,2} 

{1,2,3} 

{1,2,3,4} 

3 

(1,2,4, 3) 



{1,2,4} 


2 

(1,4, 2, 3) 


{1,4} 



3 

(1,4, 3, 2) 



{1,3,4} 


2 

(1,3,4, 2) 


{1.3} 



1 

(3,1,4,2) 

{3} 




2 

(3,4,1,2) 


{3,4} 



3 

(3,4, 2,1) 



{2,3,4} 


2 

(3, 2,4,1) 


{2,3} 



1 

(2, 3,4,1) 

{2} 




2 

(2,4, 3,1) 


{2,4} 



1 

(4, 2, 3,1) 

{4} 





Figure 1: Nested set generation for f? 4 when Si — {1,2,, i} 


[3, 2, 3, 2,1, 2, 3,1, 2,1, 2], [2, 3, 2, 3,1, 2, 3, 2,1, 2,1], 


and their reverses, which accounts for 8 of the 34 stepping sequences. Observe that if 
two consecutive moves differ by at least two, then one could do the moves in reversed 
order, and the only effect would be that the sets generated on those two moves would be 
interchanged. In particular, if [A,..., ix\ is a stepping sequence and if \ij — ij+i\ > 1, 
then the sequence obtained by transposing ij and ij + 1 is also a stepping sequence. We will 
say that these are related by commutation. Beginning with the eight stepping sequences 
coming from Theorem 2.1, we can produce 10 others that are related by commutation. 
The remaining 16 stepping sequences do not seem to arise from Theorem 2.1. These are 
[2,1, 2,3, 2, 3,1, 2, 3, 2,1] and [2, 3,1, 2, 3, 2,1, 2, 3,1, 2], together with the stepping sequences 
that can be derived from these two using reverse, complement, and commutation. 


3 The greedy method 

Consider again the nested set problem. A greedy person wants to alter the set of largest 
possible cardinality, subject to the condition that the altered set has not been seen before. 
We show that the sequence [A ,i 2 , ...] that is produced in this manner is the stepping sequence 
R m that is defined in ([3]). For example, for m — 4 the reader can check that the greedy 
method produces the sequence [3, 2, 3, 2,1, 2, 3, 2,1, 2,1], which is the same as f? 4 - 

If So, Si, ■ ■ ■, S m are nested sets with \Si\ = i, then for i — 1,..., m — 1 let S* denote the 
alteration of Si, i.e., 

s* = Si^ U(S i+1 \ Si). 

(Note that C S* C S i+ 4 , so one can replace S) by S* while maintaining the nested set 
condition.) The greedy method can formally be stated as follows. In the pseudocode below, 









S contains the sets that have been seen before, and J contains the indices i such that S* has 
not been seen before. 


G:=[] 

S:= {S lr ..,S m } 

J := {1,2, 1} 

while J ^ 0 do { 
j : = max J 
G := GU [j] 

Sj := S* 

S:=S U {Sj} 

J := {i G [1,.. .,m- 1] : S* (£ S} 

} 

return G 


Figure 2: Greedy method for nested sets 


Theorem 3.1. The greedy method produces G = R m . 

Proof. The theorem is easily seen to hold for m = 2, because the greedy sequence is [1] and 
i ?2 = [1]. Let m > 3. We will make an inductive hypothesis that the theorem is true for 
m — 1, and we will show it is true for m. Recall 

R)n = (Rm -1 + 1) U [1, . . . , m — 1] U Rm—l ■ 

Suppose initially Si = {a}. The greedy method avoids selecting i — 1 for as long as possible, 
since 1 is the smallest possible index. Thus, Si is unchanged for the first part of the greedy 
method, and at this stage all elements of S contain a. Thus effectively, the greedy algorithm is 
working with 57,,.., S rn , always selecting the maximal index such that S* has not been seen 
before. By the inductive hypothesis, R m - i, when applied to the nested sets S( = Si — {a}, is 
applying the exact same greedy strategy for determining its next index. By throwing a back 
into these sets, we find that R m -i + 1 is selecting according to the greedy strategy. Thus the 
first K elements of G are R m -i + 1, where K = ff(R m _ i). 

At this point S contains all subsets of S m that contain a. Let q t be the unique element 
in 5{ + i \ Si. Then S, = {a, qi, ..., cp-i} for i — 0,1,..., m — 1. Since S* contains Sj- 1 , we 
see that a G S* for all j > 1, so S* G S. On the other hand, 57 = (gi} is not in S, since 
a S*. Thus the greedy method selects 1. Now 57 contains a, so S* contains a for all j > 2. 
On the other hand, 57 = {# 1 ,( 72 } is new. Thus, the greedy method selects 2 as the next 
index. Continuing in this way, we see that the greedy algorithm selects 3,4,..., m — 1. So 
the greedy method continues to agree with R rn . 

At this stage, Si = {# 1 }, S 2 = {# 1 , # 2 }, S 3 = {# 1 , q 2 , #3}, • • •, 5' m _ 1 = {# 1; ..., q m -i}, and 


S = {ScS m :aeS}U{Si,S 2 ,..., 5' m _ 1 }. 




Note that SVn-i = S m \ {a}. Consequently, 

S qL S •<=>■ S C S m -1 and S ^ Si for i — 1,..., m — 1 

The third part of R m is R m -i, applied to S 0 , Si,..., S rn _\. By the inductive hypothesis, 
Rm-i follows the greedy route of selecting the largest index such that S* has not been seen. 
This completes the induction and proves the result. □ 

The greedy person has a humble cousin who always decides to alter the set of smallest 
possible cardinality, subject to the set never having been seen before. 

Lemma 3.2. The reverse of the greedy sequence is the humble sequence. 

Proof. The greedy strategy applied to the complements of S t results in the humble strategy 
applied to Si, because the complement of S t is largest when Si is smallest. Thus, the 
complement of the greedy sequence G is the humble sequence H. By Theorem 3.1, G = R m . 
By Lemma 2.2, H = complement of G = reverse of G. □ 


4 A for-loop method 

We showed that R. m can be generated with a recursion or with a greedy algorithm. Now we 
present a third way to generate R m . If c is an integer, define 02 (c) to be the largest integer v 
such that 2 V divides c - this is called the valuation of c at 2. Consider the following program. 

G:=[] 

for c = 1 to 2 m 1 — 1 do { 

Let v = v 2 (c) (i.e., 2 1 j|c) 

Let h be the Hamming weight of c 
d := m — v — h 
G —— G U [c/, d T 1, • • •, d T u] 

} 

return G 


Figure 3: For-c loop method to generate R m 


Theorem 4.1. The sequence G that is returned by the above program is equal to R. rn . 
Proof. We use induction on m. Recall the formula 

Rm = (Rm- 1 + 1) U [1, 2, . . . , m — 1] U Rm- 1 - 


The induction hypothesis shows that R m -\ + 1 is generated with the code: 



1 do { 


G:=[] 

for c = 1 to 2 m 2 — 

Let v = V 2 (c) 

Let h be the Hamming weight of c 
dl (m — 1) — v — h 

G := G U [<£ + 1, (df + 1) + 1,..., (df + v ) + 1] 

} 

By substituting d — d! + 1, this is equivalent to 

G:=[] 

for c = 1 to 2 m 2 — 1 do { 

Let u = V 2 (c) 

Let h be the Hamming weight of c 
d m — v — h 
G := G U [d, d T 1,..., d u] 

} 

Thus, the for loop of the original program with c running from 1 to 2 m-2 — 1 yields R m - 1 +1. 

Next, observe that the for-loop iteration with c = 2 m_2 sets v — m — 2, h — 1, d — 1, 
and G:=GU [l,2,...,m - 1]. So the for-loop from c = 1 to 2 m-2 yields + 1) U 

[1,2,..., m — 1]. 

Finally, the inductive hypothesis implies that the final sequence R m - 1 can be generated 
with the code 

for d — 1 to 2 m_2 — 1 do { 

Let v = u 2 (c') 

Let h' be the Hamming weight of d 
d := (m — 1) — v — h! 

G G U [d, d+ 1,... ,d + v] 

} 

return G 

By setting c = 2 m ~ 2 + d and h — h' + 1, this can be rewritten as 

for c = 2 m-2 + 1 to 2 m_1 - 1 do { 

Let v = V 2 (c) 

Let h be the Hamming weight of c 
d m — v — h 
G := G U [d, d T 1,. ■ ■, d T u] 

} 

Putting these together, we conclude that R m can be generated with the for-loop running 
from c = 1 to 2 m_1 — 1. This completes the induction and the proof. □ 

As an example, when m = 4 the for-c loop generates the following: 


[3] U [2 3] U [2] U [1 2 3] U [2] U [1 2] U [1] 


c= 1 


c=2 


c=3 


c=4 


c=5 


c=6 


c=7 



This is exactly i? 4 . 

The advantage of the for-loop method of generating R m , as compared to the recursive 
formula (151) . is that it requires less memory. 


5 A fourth way to generate R m 

So far we have given three ways to generate R m : by a recursion, by the greedy method, and 
with a for loop. Here we give a fourth method, also using a for loop. Consider the following 
code for m > 2. 

G := [m — 1] 
t := m — 2 

for j = 1 to 2 m_2 — 1 do 
let v = v 2 (j) 

G G U [t, t + 1, t + 2,..., t + v + 1] U [t + v] 
t +:= v — 1 
return G 


Figure 4: For-j loop method to generate R m 


Theorem 5.1. The above for loop generates G = R m . Also, at the end of the for loop, 
t = 0. 

Proof. We use induction on m. The theorem is easily seen to be true for m — 2, because the 
for loop is empty in that case. Now let m > 3, and assume the theorem is true for m — 1. 
We will prove it is true for m also. The induction hypothesis implies that at the end of the 
following for loop we will have G = R m - 1 and t = 0: 

G := [m — 2] 
t := m — 3 

for j = 1 to 2 m-3 — 1 do 
let v = v 2 (j ) 

G G U [t, t + 1, t + 2,..., t + v + 1] U [t + v] 
t +:= v — 1 
return G 

Since R m = (R m - i + 1) U [1,..., m] U R m - i, the induction hypothesis implies that R m 
can be generated as follows. 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 
12 

13 

14 


G := [m — 1] 
t! := m — 3 

for j = 1 to 2 m_3 — 1 do 
let v = v 2 (j) 

G := G U [If + 1, t! + 2,..., t' + v + 2] U [t' + v + 1] 
t' +:= v — 1 

G:=GU[l,2,...,ra-l] 

G := G U [m — 2] 
t := m — 3 

for j = 1 to 2 m_3 — 1 do 
let v = v 2 (j) 

G \= G U [t, t + 1, t + 2,..., t + v + 1] U [t + v] 
t +:= v — 1 
return G 


Moreover, at line 7 we know that t! — 0 and at line 14 we know t — 0. In lines 1-6, set 
t = + 1: 


1 

2 

3 

4 

5 

6 


G := [to- — 1] 
t := m — 2 

for j = 1 to 2 m-3 — 1 do 
let v = v 2 (j) 

G !— G U [i, t 1,..., t 4“ v l] U [f -I - u] 
t +:= v — 1 


Then at line 7 we know t — 1. Lines 7, 8, and 9 are equivalent to 

j : = 2 m-3 

Let v = v 2 (j) (he., v — m — 3) 

G := G U [t, t + 1, ..., t + v + 1] U [f + v] 
t +:= v — 1 


We recognize this as the iteration of the for loop with j = 2 m ^ 3 . Finally, in the last for loop 
we can change j to j + 2 m_3 without affecting the value for v. Equivalently, line 10 can be 
changed to “for j = 2 m ~ 3 + 1 to 2 m ~ 2 — 1 do”. This shows that R m can be produced by 
the above for-loop with j running from 1 to 2 m ~ 2 — 1. This completes the induction and the 
proof. □ 


We remark that the two for-loop methods for generating R rn that are given in Figures [3] 
and [4] are closely related, as shown in the next lemma. 

Lemma 5.2. The update for G in the for loop of Figure^ is equivalent to the updates when 
c = 2j and c = 2j + 1 in the for loop of Figure 0 




For example, when m — 4 the for-c loop of Theorem 4.1 and the for-j loop of Theorem 5.1 
generate i? 4 as follows: 


[3] U [2 3] U [2] U [1 2 3] U [2] U [1 2] U [1] 



Proof. Let Vj denote the valuation of v at 2, (he., 2 Vj maximally divides j. Let tj denote the 
value for t at the point in the for -j loop when G is being updated. Let d c denote the value 
for d in the for-c loop, namely d c = m — v c — HW(c). The for-j loop does the update 


G :— G U \tj , tj T 1, ..., tj T Vj T 1] U [tj T Vj ] 


(4) 


and the for-c loop does the update 


G G U [d c , d c + 1,..., d c + v c ]. 
We claim that when c = 2 j then 


[dc, • • •, d c T n c ] \tj ,..., tj Vj T 1] 

and when c = 2j + 1 then 

[d c , ...,d c + v c } = [tj + Vj], 

The claim will imply that the update (j3J) in the for -j loop is equivalent to the two updates 
with c = 2 j and c = 2j + 1 in the for-c loop. To see the claim, it suffices to show that 
d r 2j = tj, V2j = Vj + 1 , d‘2j +1 = tj + Vj, and v 2 j+i = 0 . The statements V2j = v 3 + 1 and 
v 2 j+i = 0 are obvious, so we just need to prove 

d 2 j = tj and d 2 j+i — d 2 j = Vj. 

Looking at the for-j loop program, we see that t starts at m — 2 and then Vj — 1 is added to 
t at the end of each for-loop iteration. Thus, 

tj + Vj — 1 = m — 2 + (vi — 1 ). 

0<i<j 

Lemma 5.3 below shows that the right-hand side is m — 2 — HW(j), and so 

tj=m- 2- HW(jj - Vj + l = 7n-l- HWi (j) - Vj. 

On the other hand, d^ = m — v-ij — HW(2j) = m — Vj — 1 — HW(j), and so we see that 
d 2 j = tj as claimed. Next, 

d 2 j +i - d 2j = (m - v 2j +i - HW(2 j + 1)) - (m - v 2j - HW(2j)) 

= v 2j + HW(2j) - HW(2j + 1) = v 2j - 1 = Vj. 


This proves the claim and establishes the lemma. 


□ 


Lemma 5.3. If i > 0, let Vi denote the valuation of i at 2 (i.e., 2 Vi maximally divides i), 
and let HW(i) denote the Hamming weight of i. Then 

i 

5^(1 — Vi) = HWjj). 

i=l 


Proof. This is trivially true when j = 1. Now let j > 1. Assuming the lemma is true for j, 
we will show it is true for j + 1. By the inductive hypothesis, 

j+i 

- Vi ) = HW(j) + 1 - v j+1 

i =1 

and we must show that this is equal to HW(j + 1)- That is, we must show that 

HW {j + 1) - HW(j) = 1 - v j+1 . (5) 

First, if j is even then j + 1 is obtained by changing the low bit of j from a 0 to a 1. 
Thus, both sides of the above equality are 1. Next, if j is odd, then let k be the number 
of consecutive l’s in the low bits of j. Then j + 1 is obtained from j by complementing 
the lowest k + 1 bits. That is, the bits Oil... 1 in j become 100 ... 0 in j + 1. This shows 
that Vj+ 1 = k and HWij) — HW(j + 1) = k — 1, and (J5]) easily follows. This completes the 
induction and the proof. □ 

Mark Jacobson observed that v i i s H ie valuation of j\ at 2. Thus, Lemma 5.3 is 
equivalent to the statement that the valuation of j\ at 2 is j — HW(j). This latter fact is 
proved in [3j Section 4.4], using a different proof from the above. 

6 Relationship with other combinatorial Gray codes 

In this section we show that the stepping sequences R m give rise to a new family of cyclic 
Gray codes through binary m-bit integers. Also they give rise to new Gray codes through 
^-element subsets of an m-clcment set. 

We first explain the connection with Gray codes on binary m-bit integers. If S m = 
{0,1,... ,m — 1}, then subsets of S m are in bijection with the integers in the range 0 < 
i < 2 m by the map S i—>• yb gS 2*. The cardinality of S is equal to the Hamming weight of 
its corresponding integer. When we Gray-code through nested subsets of S m , we exhaust 
through all of its subsets, and consequently we exhaust through the m-bit integers. For 
example, the stepping sequence J? 4 exhausts through subsets of {0,1, 2,3} in the following 
order (see Figure [T]): 


0, {0}, {0,1}, {0,1,2}, {0,1, 2, 3}, {0,1, 3}, {0, 3}, {0,2, 3}, 
{0,2}, {2}, {2,3}, {1,2, 3}, {1,2}, {1},{1, 3}, {3}. 


This corresponds the sequence of integers (written in binary): 

0000 , 0001 , 0011 , 0111 , 1111 , 1011 , 1001 , 1101 , 0101 , 0100 , 1100 , 1110 , 0110 , 0010 , 1010 , 1000 . 

In particular, this is a Gray-code ordering of the 4-bit integers, because at each step, only 
one bit is altered. A Gray code is cyclic if the last integer in the sequence differs by only one 
bit from the first integer in the sequence. The above example is a cyclic Gray code, because 
the first element is 0000 and the last element has Hamming weight 1. 

It is not true that all stepping sequences give rise to Gray-code orderings of integers. 
For example, the stepping sequence G = [3, 2, 3, 2,1,2, 3,1,2,1,2] produces an integer of 
Hamming weight 3 next to an integer of Hamming weight 1. 

We will say that a sequence of integers is contiguous if consecutive elements differ by ±1. 
We will say that a stepping sequence for m is strongly contiguous if it is contiguous, it begins 
with m — 1 , and it ends with 1 . 

Lemma 6.1. A stepping sequence gives rise to a cyclic Gray code if and only if it is strongly 
contiguous. 

Proof. Let G = [go,..., gx] be a stepping sequence. The initial sets So, Si,..., S m correspond 
to the integers 


00•••000,00•••001,00 • ••Oil,..., 01 • • • 111, 11•• • 111. 

The next integer produced by G has Hamming weight equal to go- This integer is one-off 
from 1 • • • 1 if and only if ^7o = ^ — 1. For 0 < i < K , the integer produced by the z-th and 
(z + l)-th elements of G have Hamming weights g % and g t +i- Because the nesting property of 
the sets is always maintained, we know that the one-bits of the integer with lower Hamming 
weight is a subset of the one-bits of the integer of higher Hamming weight. Thus, these 
two integers differ by a single bit if and only if their Hamming weights differ by one, i.e., 

| gi — g l+ 1 1 = 1. Finally, the cyclic condition mandates that the last element produced be 
adjacent to 0 • • -000, i.e., the last element must have Hamming weight 1. This is equivalent 
to g K = 1. □ 

Theorem 6.2. The stepping sequence R m is strongly contiguous for each m > 2, and so it 
gives rise to a cyclic Gray-code ordering on the m-hit integers. More generally, if A and B 
are strongly contiguous stepping sequences for m — 1, then C = (A + 1) U [1, 2,..., m — 1] U H 
is a strongly contiguous stepping sequence for m. 

Proof. We first prove the second statement. Note that C is a stepping sequence for m by 
Theorem 2.1, and we just need to show that it is strongly contiguous. Since A and B are 
contiguous, so are the three pieces that make up C. Also, since A ends with a 1, we know 
A + 1 ends with a 2, which is one-off from the first term of [1, 2,..., m — 1]. Likewise, B 
begins with rn — 2, which is one-off from the last term of [1, 2,..., m — 1]. This shows that 
C is contiguous. The first term of A is rn — 2, so the first term of A + 1 is m — 1. Also, the 
last term of B is 1. Thus, the first term of C is m — 1 and the last term of C is 1. This 
shows that C is strongly contiguous. 



Taking A = B = R m - 1 , we see that if R, n -i is strongly contiguous then so is R m . Since 
i ?2 = [1] is strongly contiguous for m = 2, it follows by induction that R m is strongly 
contiguous for all m > 2. □ 

For m = 4 and m — 5, an exhaustive search shows that R m is the only strongly contiguous 
stepping sequence. However, for m = 6 there are exactly two strongly contiguous stepping 
sequences: Rq and Aq, where 


A = [5,4, 5,4, 3, 2, 3,4, 5,4, 3, 2, 3,4, 3,2, 3, 2,1,2, 3,4, 5,4, 3,4, 3, 2, 

3,4, 3, 2, 3, 2,1,2, 3,4, 5,4, 3,4, 3, 2, 3,4, 3, 2,1,2, 3,4, 3, 2,1,2,1], 


An interesting observation is that A and Rq have a long subsequence in common: 


A 

Ra 


[5,4, 5,4, 3, 2, 3,4, 5, 4, 3, 2, 3,4, 3,2, 3, 2,1,2, 3,4, 5,4, 3,4, 3, 2, 

3,4, 3, 2, 3, 2,1,2, 3,4 , 5,4, 3,4, 3, 2, 3,4, 3, 2,1, 2, 3,4, 3, 2,1, 2,1] 
[5,4,5,4,3,4,5,4, 3,4, 3, 2, 3,4, 5,4, 3, 4, 3,2, 3,4, 3,2, 3, 2,1,2, 

3,4, 5,4, 3,4, 3, 2, 3,4, 3, 2, 3,2,1,2, 3,4, 3, 2, 3, 2,1, 2, 3, 2,1, 2,1] 


Notice that the reverse of Aq is equal to its complement. However, it is not true that all 
strongly contiguous stepping sequences have that property. The first counterexamples occur 
when m = 7: (A + 1) U [1,..., 6 ] U Rq and ( Rq + 1) U [1..., 6 ] U Aq. 

Theorem 6.2 implies that if there are N strongly contiguous stepping sequences for m— 1, 
then there are at least N 2 strongly contiguous stepping sequences for m, because when 
constructing C = (A + l)ll[l,...,m-l]US we have N choices for A and N choices 
for B. Thus, if c m denotes the number of strongly contiguous stepping sequences for m, then 
c m > c m-i- By exhaustion we found c m = 1 for m < 5, and cq = 2. Thus, C 7 > 4, c$ > 16, 
and in general c m > 2 2 " 1 6 for m > 6 . For m > 7 , it is an open question whether c m = 2 2 " 1 6 . 

By Lemma 6.1, each strongly contiguous stepping sequence gives rise to a Gray code on 
m-bit integers. Thus, we have an explicit recipe for constructing c m new Gray codes on 
m-bit integers. As shown above, the number of these is at least 2 2 ™ . 

One might wonder how many stepping sequences there are that are contiguous but not 
strongly contiguous. By exhaustive computer search we found that for m < 6 , every con¬ 
tiguous stepping sequence has the property that either the sequence or its reverse is strongly 
contiguous. It is an open question whether this is true for larger m. The only partial result 
that we have in this direction is given in Lemma 6.4 below. 

Lemma 6.3. If m is even, then a contiguous stepping sequence must begin and end with an 
odd integer. 


Proof. Let m be even. Note that R m begins with the odd integer m — 1 and ends with the 
odd integer 1. Since every other step produces a set of even cardinality and every other step 
produces a set of odd cardinality, and since the total number of steps is the odd number 
2 m — m — 1, we see that if one begins on a set of even cardinality, then more even-order 
sets are produced than odd-order sets. However, R rn is known to be a stepping sequence 
that begins on an odd integer. Then the odd-order sets (excluding the initial ones) must 



outnumber the even-order sets. It follows that any contiguous stepping sequence must begin 
with an odd integer. □ 

Lemma 6.4. Let S be a contiguous stepping sequence. Then S or its reverse has the property 
that its first element is congruent to m — 1 (mod 2) and the last element is congruent to 1 
(mod 2). 

Proof. When m is even, the result follows immediately from Lemma 6.3. When m is odd, 
then 2 m — m — 1 is even, and since the parities of the elements of S alternate, we conclude 
that the first and last elements have opposite parities. By reversing the stepping sequence 
if necessary, we can assume that the first element is even and the last is odd. The result 
follows. □ 

We showed that Gray codes through nested sets give rise to a Gray code through ?n-bit 
integers, provided the associated stepping sequence is contiguous. However, it is not true 
that a Gray code through the ??r-bit integers gives rise to a solution to the nested set problem. 
For example, consider the binary reflective Gray code sequence that is defined recursively by 
G i = [0,1] and G m = ([0] x G m _\) U ([1] x where denotes the reverse of G m _\. 

For m — 1,2,3 these are 

Gi = [0,1], G 2 = [00,01,11,10], G 3 = [000,001,011,010,110,111,101,100], 

The first few corresponding sets for G 3 are 

0 

{ 0 } 

{ 0 . 1 } 

{ 1 } 

{ 1 . 2 } 

{ 0 , 1 , 2 } 

{ 0 , 2 } 

and at this point the sets 0, {1}, {0, 2}, {0,1, 2} are not nested. 

We close this section by pointing out a connection with the problem of Gray-coding 
through ^-element subsets of an m-element set. A Gray code through ^-element subsets is 
defined as an ordering of the ^-element subsets such that two consecutive subsets differ by 
just one element, i.e., they have exactly k — 1 elements in common. This ordering is cyclic 
if the first and last Axelement subsets also differ by just one element. A Gray code through 
nested sets results in an ordered sequence of all subsets of S m , that contains each subset 
exactly once. We claim that by restricting to the A:-element subsets, we obtain a Gray code 
through /e-element subsets. To see this, recall that a new A-element set is produced when 
Sk is replaced by a new set S To obtain Sl from Sk, the unique element of Sk \ Sfc_i is 
replaced by the unique element of Sk+i \ Sk■ Thus, the two sets differ by a single element, 
and this demonstrates that a Gray code through nested sets induces a Gray code through 
/{{-element subsets. 



For example, the stepping sequence i? 6 induces the following order on 2-element subsets 
of {0,1, 2, 3,4, 5}: 


{0,1}, {0, 5}, {0,4}, {0, 3}, {0, 2}, {2, 3}, {2, 5}, {2,4}, 

{1,2}, {1,4}, {1,3}, {1,5}, {3, 5}, {4,5}, {3,4}. 

Note however that the first and last sets are not close (they are disjoint), so this combinatorial 
Gray code through order-2 subsets is not cyclic. For methods of Gray-coding through k- 
elcment subsets of an order-m set see [21 lIUIl- 
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